A fuzzy approach to Markov decision processes with uncertain transition probabilities

نویسندگان

  • Masami Kurano
  • Masami Yasuda
  • Jun-ichi Nakagami
  • Yuji Yoshida
چکیده

In this paper, a Markov decision model with uncertain transition matrices, which allow a matrix to fluctuate at each step in time, is described by the use of fuzzy sets. We find a pareto optimal policy maximizing the infinite horizon fuzzy expected discounted reward over all stationary policies under some partial order. The pareto optimal policies are characterized by maximal solutions of an optimal inclusion including efficient set-functions. As a numerical example, the machine maintenance problem is considered.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reliability Assessment of Power Generation Systems in Presence of Wind Farms Using Fuzzy Logic Method

A wind farm is a collection of wind turbines built in an area to provide electricity. Wind power is a renewable energy resource and an alternative to non-renewable fossil fuels. In this paper impact of wind farms in power system reliability is investigate and a new procedure for reliability assessment of wind farms in HL1 level is proposed. In proposed procedure, application of Fuzzy – Markov f...

متن کامل

Loss Bounds for Uncertain Transition Probabilities in Markov Decision Processes

We analyze losses resulting from uncertain transition probabilities in Markov decision processes with bounded nonnegative rewards. We assume that policies are pre-computed using exact dynamic programming with the estimated transition probabilities, but the system evolves according to different, true transition probabilities. Our approach analyzes the growth of errors incurred by stepping backwa...

متن کامل

Diffusion Approximation for Bayesian Markov Chains

Given a Markov chain with uncertain transition probabilities modelled in a Bayesian way, we investigate a technique for analytically approximating the mean transition frequency counts over a finite horizon. Conventional techniques for addressing this problem either require the enumeration of a set of generalized process "hyperstates" whose cardinality grows exponentially with the terminal horiz...

متن کامل

A Markov Model for Performance Evaluation of Coal Handling Unit of a Thermal Power Plant

The present paper discusses the development of a Markov model for performance evaluation of coal handling unit of a thermal power plant using probabilistic approach. Coal handling unit ensures proper supply of coal for sound functioning of thermal Power Plant. In present paper, the coal handling unit consists of two subsystems with two possible states i.e. working and failed. Failure and repair...

متن کامل

A fuzzy treatment of uncertain Markov decision processes: Average case

In this paper, the uncertain transition matrices for inhomogeneous Markov decision processes are described by use of fuzzy sets. Introducing a ν-step contractive property, called a minorization condition, for the average case, we fined a Pareto optimal policy maximizing the average expected fuzzy rewards under some partial order. The Pareto optimal policies are characterized by maximal solution...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Fuzzy Sets and Systems

دوره 157  شماره 

صفحات  -

تاریخ انتشار 2006